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Abstract 

c/2 ■ The mathematical distinction between prose and verse may be 

detected in writings that are not apparently lineated, for example 
' in T. S. Eliot's Burnt Norton, and Jim Crace's Quarantine. In this 

^ ■ paper we offer comments on appropriate statistical methods for such 

work, and also on the nature of formal innovation in these two texts. 
Additional remarks are made on the roots of lineation as a metrical 
form, and on the prose-verse continuum. 



1 Introduction 

In Aoyama and Constable (1999) and Constable and Aoyama (1999) we 
have developed a technique, the Q n distribution, for detecting a mathe- 
matical distinction between prose and isometrically lineated text (verse) in 
English. For prose the Q n distribution is flat, for isometrically lineated 
verse it is peaked at the value of n which equals the number of syllables 
in the normative line. Our major focus in that work was on the theoret- 
ical value of such a distinction, while in this paper we will turn to look 
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more closely at a point raised before only in passing (see Constable and 
Aoyama (1999:515)), namely the possibility that the features we have iden- 
tified might be employed as a 'test' or diagnostic for lineation, enabling 
an analyst to detect the presence of mathematically lineated text where it 
is not visually represented on the page (syllabic verse printed as prose for 
example) or is concealed by the layout on the page, as for example when 
a sonnet is embedded in prose. We cautioned against the belief that our 
procedure would enable the detection of small passages of verse embedded 
in prose on the grounds that the effects of such lineation on the Q n dis- 
tribution would be so small that any peaks resulting would be swamped 
by random fluctuations in the rest of the distribution. However, we sug- 
gested that for longer texts the case might be different, and our aim here 
is to examine two works, T. S. Eliot's poem Four Quartets (1936-1943) and 
Jim Crace's novel Quarantine (1997) with a view to assessing the reliability 
of the procedure, and developing statistical techniques for determining the 
strength of inferences based on the computation of Q n distributions. 

In concluding we offer further comments on the experimental goals of 
these two texts, the roots of lineation as a metrical form, and the prose- verse 
continuum. 

2 Mathematical Preliminaries 

The essential feature of the syllabic structure of English prose found by 
Aoyama and Constable (1999) is random segmentation; by which we mean 
that the probability of having a word boundary after any syllable is constant. 
Mathematically, this implies that the probability of the syllable length of 
the k-th word, S k , being equal to S is given by a geometric distribution; 

P{S k = S} = q(l-q) S -\ (1) 

where q is the probability of the occurrence of a word boundary after any 
syllable, this latter probability being a parameter which varies between data 
sets. The reader might note that the right hand side of the probability ([!]) 
is independent from k, that is, the probability distribution does not depend 
on where the word is located in the whole article. 

This property of random segmentation leads to the finding that the 
expectation value of the number of syllables of the fc-th word is independent 
from k; 

E{S k } = s, (2) 
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where the mean syllable length s is given by s = 1/q, and the following 
results for the expectation value of the products of the syllable lengths of 
the k-th word and the k'-th word: 

E{(S k - E{S k })(S k , - E{S k ,})} = 6 kt „A, (3) 

where the variance A is given by A = (1 — q)/q 2 . 

The finding noted above was made mainly through studies of the Q n 
distribution, supplemented with studies of the correlations between word 
lengths. In Constable and Aoyama (1999), we found by examining the Q n 
distributions that isometrically lineated verse shows systematic deviation 
from the above properties. In investigating the properties of Quarantine 
and Four Quartets, we find it useful to utilize two other standard statistical 
tools, Fourier analysis and Correlation functions, in addition to the Q n 
distribution. In the following, we will define these three quantities to prepare 
for the analysis. 

2.1 Fourier analysis 

The Fourier component S m is defined by the following equation: 

S k = ^=j: S m e 2mmk / K , (4) 
v K m=0 

where K is the total number of words in the data set. The Fourier compo- 
nent can be directly calculated by the following inversion formula; 

s m = 4^J2 s ^' 2mmk/K - (5) 



k=l 

These Fourier components satisfy the following relations: 

S m +K — S m , (6) 

Sk-tu = Sm- ( 7 ) 

If the data set is randomly segmented, the expectation values for the 
Fourier coefficients S m satisfy the following equation: 

E{S m } = VKs5 mfi , E{\S m \ 2 } = A + ^ 6 m>0 , (8) 
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which can be proved by using Eqs.@ and (H). 

The Fourier analysis is sensitive to any periodic structure in the data; if 
there is a periodicity with a period of £ in the sequence {Si, S 2 , ■ ■ ■ , Sk}, the 
coefficient Sk/i ( or hs absolute value) will be large compared to the other 
Fourier coefficients. The degree of predominance depends on the strength 
of the periodicity: If the periodicity is weak the predominance of Si will be 
weak. 



2.2 The Correlation Function of Word Length 
We will now turn to a consideration of the correlation function: 
r = E{(S k -E{S k })(S k+e -E{S k+e })} 

i ~ E{(s k - E{s k m • [) 

where the subscript of S is defined by modulus K, i.e., Sk+i = S±, and so 
on. Since the value of K is typically of the order 10 3 ^> 1, this does not 
greatly affect the value of G(. From Eq.(^), it is evident that the randomly 
segmented data leads to the following: 

Ge = 4,fc' (10) 



2.3 The Q n distribution 

The probability Q n is defined as the probability that a sequence of adjacent 
words has the total number of syllables n. To be precise, let us define L n ^ k 
to be the number of occurrences that k sequential words have n syllables 
in total. From this definition, it is evident that the following identity is 
satisfied: 

oo 

Y,Ln,k = K. (fl) 
n=l 

This is because since there are K words in the data, there are K sequences. 
The Q n distribution is defined by the following: 

1 n 

Qn = JT ^2 Ln,k- (12) 
A k=l 

The upper limit of the sum in the above is induced by the property that 
L n ,k — for n < k, which follows from the fact that any English word is at 
least one-syllable long. 
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An alternative, but equivalent, definition of Q n is that it is the proba- 
bility that a word boundary occurs n-syllables after a word boundary. In 
this sense, Q n may be called the word-boundary correlation function. 

3 Jim Crace Quarantine 
3.1 Fourier analysis 




20,000 40.000 60,000 80,000 

m 



Figure 1: The real part (upper plot) and the imaginary part (lower 
plot) of the Fourier components S m for Quarantine for m = 1 to 
80,009. 

The plot of the Fourier coefficients S m for Quarantine (with K = 80, 010 
words) is given in Fig.[l]. Due to the identity (^), the real part of S m is 
symmetric around the middle-point m = K/2, while the imaginary part of 
S m is anti-symmetric around the middle-point. 

It is apparent that the real part of S m has a significant peak denoted by 
the letter A at the middle-point m = K/2, where Sk/2 — —2.662. No other 
structure is visible and the rest of the Fourier coeficients are consistent with 
the white noise characteristics (0) and @. In fact, the average number of 
syllables in this section is s = 1.303, which leads to v^A = 0.628. This 
humber is consistent with the plotted result, as is apparent in Fig.|2|. If the 
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Figure 2: The comparison between the Gaussian accumulated distri- 
bution ( |7^ ) and the behaviours of the actual Fourier coefficients of 
Quarantine. 



random segmentation is valid, the central limiting theorem dictates that 
the distribution of the Fourier coefficients S m for m^O should follow the 
Gaussian distribution, 



Pg(s) 



1 



cxp 



s 

2A 



(13) 



where s = Re(S m ),lm(S m ). In Fig.|^, the solid curve is the accumulated 
distribution, 

poo 

P(> S) EE / P G (s)ds, (14) 



with the above value of A. The dots are plots of the positive Re(S m ) versus 
its rank (the largest being 1, next 2, and so on) devided by the total number 
of the positive Re(5 m ). The open circles are similar plots for the negative 
Re(5 m ). Within statistical accuracy the dots and open circles should follow 
the solid curve if the Gaussian distribution applies. In Fig.|2| we see that 
almost all the points are close to the solid curve, except for the isolated point 
A at the far right, which is the peak A at m = K/2 in Fig .[I]. The fact that 
this point A is far above the theoretical curve suggests that it is very unlikely 
that this value of A is achieved simply by a statistical accident. This can be 
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explained as follows. At this point, s = 2.662, we find that the accumulated 
probability distribution is P(> s) = 2.037 x 10~ 9 , which means that the 
probability that a point exists beyond this value of s is 2.037 x 10~ 9 . On 
the other hand, there are 19,861 negative Re(5 m ) and the point A exist at 
this s. So the "measured probability" is 1/19,861 = 5.035 x 10~ 5 , which is 
the vertical coordinate of this point. Therefore, the existence of this point 
A simply by accident is quite unlikely (by the probability 4.045 x 10 -5 ). 
Putting this in another way, we may argue as follows: Since we have k = 
K/2 number of points, the expected number of points beyond this value of 
s is equal to P{> s) x (K/2) = 4.07 x 10~ 5 . This should be contrasted 
with the existence of a single point A, which leads to a conclusion that the 
existence of A by chance is almost impossible. This argument establishes 
that the peak A in Fig.[TJ is not an accident of statistical fluctuation, but is 
a real effect. 

The existence of the peak A shows that there is a strong period of 2 
in Quarantine. This is readily seen from the fact that the contribution of 
the m = K/2 term in the expression of the Fourier series (f|) is (— l) m 5jf/2- 
Since Sk/2 is negative, we find that throughout Quarantine, words in even 
positions (2nd, 4th, 6th, ■ • ■ words) tend to be shorter than those in odd 
positions. That is to say, there is a tendency for shorter and longer words 
to alternate in Crace's text. The relationship between this fact and the 
rhythmical patterning and consequent lineation noted below may be easily 
guessed at, but has yet to be examined in any scrupulous way. 

3.2 The Correlation Function 

The result is plotted in Fig.^|, where it will be seen that there is no apparent 
structure and the result is consistent with the prediction ( |i~0"D from the 
random segmentation hypothesis. The only deviation from ( |I~0"D is the slight 
deviation at i = 2. We find that this is qualitatively consistent with the 
analysis done for the Q2 dip in English prose by Aoyama and Constable 
(1999). 

3.3 The Q n distribution 

Fig.[| shows the Q n distribution for Quarantine. The peaks at n = 4, 6, 8, 
10, and so on, are sufficiently evident. It should also be noted that the value 
for n = 2 is not, as might be thought, inconsistent with these results. In 
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Figure 3: The correlation function Gi for Quarantine. 

fact a dip at n = 2 is normal in English prose (see Aoyama and Constable 
1999), and that found in Quarantine proves to be rather smaller than would 
otherwise be expected. 

In other words what appears to be a puzzling dip at n = 2, where one 
might naively expect a peak relating to those at multiples of two, is in fact 
a smaller than usual depression. The underlying causes of this reduced 
depression appear to be an enhancement of disyllables, as can be seen by 
comparing the word length distribution of Crace's text with both an ideal 
geometric distribution and the data from a large sample of English prose 
(See Fig||). Crace uses slightly more disyllables than would be predicted 
from the geometric distribution, whereas the prose corpus texts employ 
slightly less. 

4 T. S. Eliot Four Quartets 
4-1 Fourier coefficients 

We have found that the Fourier coefficients S m for any of the four sections 
have no significant structure, unlike Quarantine. They are all consistent 
with the white noise characteristics (0) and (H). 
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Figure 4 : The Q n distribution for Quarantine. 



4-2 The Correlation Function 

The result is plotted in Fig. |6], where it will be seen that there is no apparent 
structure. 

4-3 The Q n distribution 

Fig.[7] shows these Q n distributions for each of all four sections of The Four 
Quartets. As is apparent in this plot, Burnt Norton alone has a marked 
peak, at n = 4, a fact which suggests that a substantial part of it is com- 
posed in units of four syllables. 1 Of course due to the randomness of the 
original syllable distribution, one might observe a similar peak by chance. 
Some care is required in determining whether this peak is the result of 
authorial compositional ordering, or a simple accident. We can determine 
the statistical significance of the peak Q 4 by calculating the average (Q) 
and the standard deviation (erg) of Q n from n = 1 to 200 . For the Burnt 
Norton section we find the following values: 

1 N 

Q=mHQu = 0.6914, 

1 Similar peaks are observed for isometrically lineated verse. See Constable and 
Aoyama (1999). 
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Figure 5: The probability distributions of word length (dots) and the 
geometric distribution (solid line) for all of the prose data in Aoyama 
and Constable (1999) and Quarantine. The geometric distributions 
were defined so that the average number of syllables per word agrees 
with the actual distributions for each case. 



1 



N 



J2(Qn-Q) 2 = 0.007219, 



n=l 



where N = 200. The value of the Q4 in this section is, 

Q 4 = 0.7218 ~Q + 4.2 a Q , 



(15) 



(16) 



showing that this is a "4<r effect" . 

Another way to guage the significance of the Q4 peak is to make a plot 
similar to Fig.0. The probability of such a large deviation from the mean 
value Q is given by the accumulated probability P(> Q), which is defined 
as the probability that a value larger than or equal to Q is observed; 



P(>Q)= / P G (Q')dQ', 
Jq 

where Pq(Q) is the Gaussian distribution; 

(Q ~ Qf 



(17) 



Pg(Q) 



cxp 



2af 



For Q 4 = 0.7218, we find that P(> Q 4 ) = 1.3 x 10~ 5 . Straightforwardly, the 
probability of observing as large a peak as that for Q4 simply by accident 
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Figure 6: The correlation function Gi for each of the sections. 



is about 1.3 x 10 -5 . This situation is shown graphically in Fig|| which is 
similar to Fig.||. In this figure, the accumulated probability density P(> Q) 
defined in Eq. (|i7|) is shown by the dashed line and the actual distribution of 
Q n in this section by dots. The agreement between the continuous Gaussian 
distribution with the actual measured value is excellent, except for Q4. This 
both justifies the use of the Gaussian distribution above, and visualizes the 
abnormality of the Q4 peak. 

This result is consistent with the lack of any structure in the Fourier coef- 
ficients S n , since the latter are only sensitive to periodic structures involving 
the syllable numbers {Si, S 2 , ■ ■ ■ , Sk}, and would be ideal for studying such 
structures. That is to say, if the four syllable units appear in combination 
and in a periodic manner, the Fourier coefficients would make their existence 
evident. However, four syllable units, may appear in various combinations, 
of 1 and 3 syllables, 2+2, 1+1+2, and so on; furthermore, they may appear 
randomly in the section. Both of these facts would disrupt the periodic 
structure, and render the Fourier analysis useless. The same is true for the 
correlation function Gi . On the other hand, our word-boundary correlation 
function Q n is by definition sensitive to the existence of the four syllable 
units even in this situation. 
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Figure 7: The Q n plot of the four sections. 



5 Conclusion and Further Comments 

We have shown that the Q n computation is a sound and fairly sensitive 
register of one fundamental feature of lineated text, and complementary in 
some respects to two other statistical methods. In concluding what has 
been so far a methodological paper we will return to the language materials 
under consideration and offer several brief comments on the significance of 
the Q n peaks detected in the Four Quartets and Quarantine. 

5.1 T. S. Eliot, Four Quartets 

The metrical status of Eliot's Four Quartets has been much debated, notably 
in Cooper (1998). Q n analysis will not resolve all aspects of this debate, but 
does contribute significantly in regard to lineation. Our findings are that 
with the exception of Burnt Norton the Quartets are not mathematically 
lineated. In this we are in basic agreement with Cooper, who carried out an 
exhaustive rhythmical scansion. We supplement his remarks by observing 
that whilst the later three Quartets appear on reading to be to some degree 
rhythmically regular (and there are passages which are very obviously com- 
posed in isometric lines), Burnt Norton alone is mathematically lineated 
overall, the basic mathematical line being a segment of four syllables. It 
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Figure 8: The log-linear plot of the accumulated probability density 
Q) (dashed line) and the distribution of the measured Q n (n = 
1 ~ 1000) for the Burnt Norton section. The agreement with the 
Gaussian distribution is evident, except for Q^. 



seems likely that this results from a two beat duple segment, running as 
follows: offbeat, beat, offbeat, beat. It should be noted from the Q n dis- 
tribution that there is no subsequent peak at n = 8, suggesting that while 
these four syllable segments are frequent, they are not so frequently adja- 
cent as to produce an eight syllable mathematical line. This is surprising, 
and suggests deliberate avoidance on Eliot's part, as if he were unwilling to 
let his rhythms move too close to the familiar four beat octosyllabic unit. 

It seems possible that the poem as it survives traces an experiment 
with rhythmical patterning, and that Eliot's neglect of the 4 syllable line 
in the subsequent sections is a refinement of method, and the adoption of a 
revised compositional principle which permitted the production of the de- 
sired rhythmical effects without relying on clear echos of regular metrical 
structures. Burnt Norton is an early attempt to employ rhythmical metre 
in a form which is looser in its lineation structure than standard verse. As 
it happens it is still mathematically lineated. The subsequent three Quar- 
tets reveal further and more adventurous developments of this experiment. 
It would be interesting to know whether this anticipates in any way the 
character of the metre employed in Eliot's later plays, and we note this as 
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being a topic for future research. 
5.2 Jim Crace, Quarantine 

Crace's writing, certainly the more recent books, has been very widely re- 
garded as of a type differing from standard prose. The critic Frank Kermode 
has noted of Quarantine that it is from 'the end of the fiction spectrum where 
the novel is most like a poem, most turned in on itself, most closely wrought 
for the sake of art and internal cohesion' (Kermode 1998). John Banville, 
discussing another of Crace's novels, Being Dead, has even pointed out that 
technically the prose often uses verse fragments, and that much of this book 
is 'written in a kind of broken blank verse, and indeed could be successfully 
laid out in verse form', adding that Crace is 'particularly fond of iambic 
pentameter' (Banville 2000). 

Certainly, Quarantine is readily recognized as being rhythmically more 
regular than ordinary prose, but there are no visually salient lines, and even 
careful examination during reading fails to find any convincing or consis- 
tent division. Although it is possible, as Banville says of Being Dead, to find 
occasional lines, even short sequences, it is difficult to successfully display 
any sizeable piece of the text as isometric verse. Casual examination can 
make no further progress. However, the Q n procedure is sufficiently sensi- 
tive to detect lineation even when distributed in small and isolated packets, 
and of Quarantine it reveals that syllabic groups of two, four, six (which is, 
curiously, less prominent), eight, ten, and subsequent multiples of two are 
significantly more common than they would be in standard prose. Quaran- 
tine is non-randomly segmented, and even though it does not employ a core 
isometric line length, and its 'lines' do not follow on one from another, it is 
still, and in a novel and important sense, lineated. 

The fact that there is no particular preference for one line length over 
another, that is no preference, say, for decasyllabic groups over octosyllabic 
groups, is of very great interest, and suggests that Crace did not set out to 
write in lines and then to conceal them by printing his work with a prose 
layout. Instead we suspect that the lineation detected by the Q n proce- 
dure arises as a byproduct of a deliberate organisation of the rhythmical 
patterning of the text. 

The character of this general organisation can be inferred from the Q 2 
data. As noted above (see Section 3.3), although apparently anomalous, 
because it seems to register a depression rather than a peak, this data 
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point is in fact consistent with the subsequent peaks at multiples of two, 
since the depression in Crace's text is actually smaller than expected. The 
fundamental cause of this, as noted in our discussion above, is simply that 
Crace has employed slightly more disyllabic words than would be expected 
in standard prose, and thus we may conclude that lineation arises in Crace's 
text from a predisposition to employ disyllabic words so as to facilitate the 
construction of alternating rhythms. 

As far as we know Crace has not commented in public on his reasons 
for employing this structuring. If asked he would perhaps reply, taking up 
terms similar to those used by Kermode above, that he felt it gave his work 
a self-supporting and self-consciously artistic structural brace that unpat- 
terned prose did not possess. While this might be a satisfactory proximal 
psychological explanation, we suspect that a deeper account of the resulting 
experiences for both reader and writer is available and may more adequately 
account for the appeal of this formal device. Anyone who has read Crace's 
work will agree with John Updike that it has a 'hallucinatory' quality (Up- 
dike 1999). That this is not readily accounted for by reference to the facts 
of the narrative is worth remark, and is a quality which recalls the character 
of much of the most successful poetry. 

The exact consequences of the variety of mathematical lineation found 
in Quarantine are yet to be fully understood, but it is so far clear that at 
least some of the previous restrictions noted in regard to isometrically lin- 
eated text also apply to Crace's composition. The peaked Q n distribution 
indicates that some degree of syntactic distortion occurs, and we have seen 
above that dictional distortion is present in the enhancement of disyllabic 
words. It is also possible that even though he was not composing deliber- 
ately in a single isometrical line, but varying his segment length, that some 
reduction in mean word length has taken place, in order to increase the 
syntactical options (a known consequence of isometric lineation: see Con- 
stable and Aoyama 1999). Both these distortions have been discussed by 
Constable (1998) within his Disruption Theory, which proposes that read- 
ers attempting to process text which is evenly and subtly disrupted can 
neither reject it as damaged nor successfully interpret it, and consequently 
experience an illusion of profundity (see also the sketch in Constable and 
Aoyama (1999)). However, it seems almost certain that with the variety of 
lineation employed in Crace's Quarantine these disruptions are less marked. 
However, it should be noted that a further source of disruption, that caused 
by rhythmical patterning, may well be present, namely an alteration in the 
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frequency of stressed and unstressed syllables leading to an increase in the 
frequency of content terms. This disruption is also found in standard du- 
ple verse, but in Crace's text it may be the principal source of disruption, 
with syntactical and word length distortions playing a much smaller part. 
The tendency to use more disyllables is relevant here, though its dictional 
effects, on the frequency of the various parts of speech for example, are at 
present unknown. 

On such a view the text is experimental in the sense that it varies the 
proportions of the disruptive influences in order to produce new experiences 
for readers. It might be seen as an attempt to provide prose with some of 
the richness of effect familiar from isometric verse without suffering to the 
same degree its disablements, and should be compared with experimental 
works such as Eliot's Four Quartets which approach the problem from the 
other side, and beginning from a verse base move towards the freedom of 
prose without losing the benefits of disruption (see Constable (1998) for a 
discussion of the decline of isometric verse and the rise of prose in terms of 
disruption theory). 

5.3 The Roots of Lineation and Formal Innovation 

Comparison of these two texts sheds light on the relationship between math- 
ematical lineation and rhythmical organisation. In one case, Four Quartets, 
we have rhythmical organisation with and without mathematical lineation, 
whereas in Quarantine we find both mathematical lineation and rhythmical 
organisation, but with lineation only appearing as an unconscious byprod- 
uct of the attempt to create regular rhythmical effects. 

This may shed some light on the roots of visible and salient lineation as 
a metrical rule, since we can see that even when an author is not aware of 
lineation as a constraint, which is arguably the case in preliterate composers, 
lineation may arise as a byproduct of rhythmical regularity. What we have 
in Crace is some evidence suggesting that lineation is a likely, indeed a very 
probable outcome of rhythmical organisation. We hypothesize that the 
roots of lineation as a metrical restriction are deep, at least in English, and 
that mathematical lineation may be coeval with rhythmical patterning. Its 
extraction and development into a separately understood metrical object, 
the line, is likely to have come much later. What is particularly interesting is 
that while lineation should have been recognized as independent of rhythm 
(i.e. in syllabic verse), there has been no parallel recognition of regular 
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rhythmical patterning without isometric lineation. 

But, and this is the most interesting point arising from Eliot's texts, 
lineation would appear to be unnecessary to some degree of rhythmical or- 
ganisation, and with care it seems possible that it may be avoided. We 
speculate that Eliot's experimental work in Four Quartets is not an isolated 
example, and that others may be found in Seventeenth Century Drama, par- 
ticularly in Shakespeare. Nevertheless, even if such antecedents are found 
it seems likely that regularly rhythmical but unlineated text is a largely 
unexplored technical possibility, one which does not come about by chance 
with any great frequency and one which is difficult to achieve deliberately, 
certainly without detailed knowledge. It is also possible that given other 
features of English, namely those relating to word length, part of speech 
frequency, and stress, that mathematical lineation is an all but unavoidable 
consequence of definite rhythmical patterning if noticeably unnatural out- 
put is to be avoided. Further theoretical work is expected to permit more 
conclusive comments on this matter. 

Interesting though Quarantine is, our tests show that there is still room 
for further experimentation in this vein, and the abstract understanding of 
lineation offered in this paper and our previous papers, may go some way to 
assisting writers in devising composition methods which further minimize 
lineation. The psychological effects of text of this type are quite unknown, 
but it is conceivable that a diction somewhat distorted by the construction 
of rhythmical regularity would combine with very well with a syntax undis- 
rupted by non-random word length ordering, and perhaps produce a formal 
constraint as vigorous and attractive to readers as isometric verse once was. 

5.4 The Prose-Verse Continuum 

The characters of Eliot's Quartets and Crace's Quarantine as revealed here 
both act as an incitement to reconceive the relationship between verse and 
prose. Colloquial and academic literary critical discussion, has generally 
treated 'verse' and 'prose' as intuitively obvious exclusive categories sepa- 
rated by an ill-defined grey area. This essentialist tendency has been en- 
couraged by the binary leanings of generative metrics, and our own work on 
the computable distinction between prose and verse might even be seen as 
reinforcing this habit. However, there is nothing in the facts of the matter 
which obliges us to conceive of prose and verse in this rigid way, and we 
believe that careful consideration of our theoretical research will show that 
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there are very good reasons for taking up an alternative conception, namely 
that what we think of as typical verse and typical prose are distinguishable 
clusters of texts positioned on a continuum. 

Underlying such a view is the fundamental insight that while isometric 
verse involves an ordering of word length this is not a feature which is 
either absolutely present or absent, but one that varies in degree between 
texts. Thus we may say that there is a continuum between random disorder 
of word length on one side and maximal order on the other. From an 
abstract theoretical perspective we can see that any particular text may 
lie at some point along this continuum, but practically speaking there are 
considerable difficulties in the way of assigning it to a precise position. 
However, this need not deter us from recognizing the utility of the continuum 
as a conception, and in this light the difficulties of text assignment appear 
as technical challenges rather than as obstacles to accepting the continuum 
in the first place. 

A key issue in addressing this matter will be a refined understanding of 
word length ordering. All the prose texts we have so far studied in fact ex- 
hibit a very small degree of order in the fine structure of the word length pro- 
file (see Aoyama and Constable (1999), and Constable and Aoyama (1999)), 
and so seem to lie at some distance from the theoretical prose pole, while at 
the other extreme no text so far examined is without randomness in its word 
length characteristics, and thus lies at some distance from the theoretical 
verse pole. In between we can be fairly confident that some texts we have 
studied lie closer to the theoretical maximum of order, such as Longfellow's 
Hiawatha, which has a very marked repeating structure in its Q n , while 
others seem to lie closer to the disordered pole, such as most blank verse 
by Wordsworth, whose normative line length peaks are modest in size and 
whose multiple peaks diminish rapidly as a result of weak long range cor- 
relation (i.e. high frequency of variant lines). The location of a text such 
as Crace's Quarantine is more difficult to determine. As noted above, while 
it exhibits a repeating structure it has no major peak (i.e. no predominant 
line length), but instead a whole series of minor peaks. At present we are 
not in a position to distinguish definitively between the degree of order in 
such a text and that in, for example, a fundamentally isometric text such as 
Wordsworth's Prelude. It is tempting to suppose that Quarantine must lie 
closer to the prose end of the continuum, on account of its lack of concen- 
trated order, but this, though convenient, may be no more than a residual 
effect of the exclusive categories that the conception of the continuum was 
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introduced to replace. Satisfactory resolution of this matter will not be 
possible until some better method has been determined for remarking upon 
the degrees of order and disorder in a text's lineation structure, and it may 
be noted here in passing that the Fourier analysis will probably of use in 
this regard. 
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